Sparse Covariance Coding

نویسندگان

  • Aaron Courville
  • Dumitru Erhan
  • Pascal Vincent
  • Yoshua Bengio
چکیده

Recently, there has been considerable interest in employing unsupervised learning methods as feature extractors for supervised learning tasks such as classification. The literature shows that methods based on this approach have proved to be competitive with established state-of-the-art machine learning strategies. One important recent advance was the discovery by (Hinton, Osindero, & Teh, 2006) of the role that unsupervised learning methods can play as an initialization for subsequent training for a supervised task. Unsupervised learning methods that extract sparse features of the data have received particular attention and has been shown by Raina, Battle, Lee, Packer, and Ng (2007) to significantly improve classification performance. While there are some clear advantages of not including label information in learning the feature set or initialization — for instance, Raina et al. (2007) use a sparse coding scheme trained on a large quantity of “related” unlabeled data to augment the feature set training — the use of unsupervised feature extraction gives no safeguard against highly discriminative features being cast aside. In relatively uncomplicated tasks, such as classifying hand-written digits using the MNIST dataset (Lecun, Bottou, Bengio, & Haffner, 1998), the lack of supervisory information in determining the representation is not likely to present a problem as most of the salient features of the image are useful in the classification task. However in more complex tasks where a large number of salient features of the data can have nothing to do with the target task, unsupervised learning methods are not likely to efficiently generate discriminative features. On the other hand, the lesson of the utility of extracting features of the unsupervised input data pattern, demonstrated in Hinton et al. (2006) and in Raina et al. (2007) should not be ignored. In this work, we focus on the problem of learning a sparse representation of data that stakes out a compromise position: explicitly taking into account information such as the labels in a classification task, while simultaneously attempting to capture descriptive features of the input data. Our method is based on a novel probabilistic interpretation of the canonical ridge analysis (Vinod, 1976), a regularized version of canonical correlation analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Face Recognition using an Affine Sparse Coding approach

Sparse coding is an unsupervised method which learns a set of over-complete bases to represent data such as image and video. Sparse coding has increasing attraction for image classification applications in recent years. But in the cases where we have some similar images from different classes, such as face recognition applications, different images may be classified into the same class, and hen...

متن کامل

A Sparse Covariance Function for Exact Gaussian Process Inference in Large Datasets

Despite the success of Gaussian processes (GPs) in modelling spatial stochastic processes, dealing with large datasets is still challenging. The problem arises by the need to invert a potentially large covariance matrix during inference. In this paper we address the complexity problem by constructing a new stationary covariance function (Mercer kernel) that naturally provides a sparse covarianc...

متن کامل

Traffic Scene Analysis using Hierarchical Sparse Topical Coding

Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...

متن کامل

Rice Classification and Quality Detection Based on Sparse Coding Technique

Classification of various rice types and determination of its quality is a major issue in the scientific and commercial fields associated with modern agriculture. In recent years, various image processing techniques are used to identify different types of agricultural products. There are also various color and texture-based features in order to achieve the desired results in this area. In this ...

متن کامل

High-dimensional Joint Sparsity Random Effects Model for Multi-task Learning

Joint sparsity regularization in multi-task learning has attracted much attention in recent years. The traditional convex formulation employs the group Lasso relaxation to achieve joint sparsity across tasks. Although this approach leads to a simple convex formulation, it suffers from several issues due to the looseness of the relaxation. To remedy this problem, we view jointly sparse multi-tas...

متن کامل

Self-regulation Mechanism of Temporally Asymmetric Hebbian Plasticity

Recent biological experimental findings have shown that synaptic plasticity depends on the relative timing of the pre- and postsynaptic spikes. This determines whether long-term potentiation (LTP) or long-term depression (LTD) is induced. This synaptic plasticity has been called temporally asymmetric Hebbian plasticity (TAH). Many authors have numerically demonstrated that neural networks are c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008